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Abstract. We consider algebras with one binary operation • and one generator (monogenic) 
and satisfying the left distributive law a ■ (b ■ c) = (a ■ b) ■ (a • c). One can define a sequence 
of finite left-distributive algebras A n , and then take a limit to get an infinite monogenic left- 
distributive algebra Aoo . Results of Laver and Steel assuming a strong large cardinal axiom 
imply that Aoo is free; it is open whether the freeness of Aoo can be proved without the 
large cardinal assumption, or even in Peano arithmetic. The main result of this paper is the 
equivalence of this problem with the existence of a certain algebra of increasing functions on 
natural numbers, called an embedding algebra. Using this and results of the first author, we 
conclude that the freeness of Aoo is unprovable in primitive recursive arithmetic. 



1. Introduction 

We consider algebras with one binary operation • and one generator [monogenic) and 
satisfying the left distributive law a ■ (b ■ c) = (a • b) ■ (a • c); in particular, we look for a 
representation of the free algebra. 

The word problem for the free monogenic left-distributive algebra was solved by Laver [6] 
under the assumption of a large cardinal and subsequently by Dehornoy [4] without such 
an assumption. Laver's result uses elementary embeddings from V\ into V\ under the 'ap- 
plication' operation • defined by j -k = {J a< \j(knV ce ). If there exists such an embedding j 
other than the identity, then the algebra Aj generated by j is free. 

When the embeddings in Aj are restricted to an initial segment of V\ , they form a finite 
monogenic left-distributive algebra [7], and these finite algebras can be described without 
reference to elementary embeddings. In fact, for every n there is a (unique) left-distributive 
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operation * n on the set A' n = {1,2,..., 2 n } such that a * n 1 = a + 1 for all a < 2 n and 
2 n * n l = l. 

There is a natural way of defining a limit of the algebras A' n , and one can ask whether 
Aoo is free. We reduce this problem to a simple (II2) statement of finite combinatorics, and 
show that the answer is affirmative provided there exists a nontrivial elementary embedding 
from V\ into itself. The crucial fact used in the proof is a theorem of Laver and Steel [7] 
on critical points of elementary embeddings. 

It is open whether the freeness of Aoo can be proved without the large cardinal assump- 
tion, or even in Peano arithmetic. The main result of this paper is the equivalence of this 
problem with the existence of a certain algebra of increasing functions on natural numbers. 

We introduce embedding algebras, which are algebras (A, •) of increasing functions a: u — > 
ui endowed with a binary operation •. The axioms for embedding algebras state that the 
operation a ■ b is left distributive and interacts with critical points (the critical point of a 
function is the least number moved by the function) in the expected way. If a (nontrivial) 
embedding algebra A exists, then A^ is free; conversely, we construct an embedding 
algebra under the assumption that A^ is free. 

The first author proved [5] that the critical sequence for a nontrivial elementary embed- 
ding j yields an enumeration of critical points in Aj that grows faster than any primitive 
recursive function. One consequence of the main theorem is that such a fast-growing func- 
tion can be defined under the assumption that Aoo is free. It follows that the freeness 
of Aqq is unprovable in primitive recursive arithmetic. 

2. The free monogenic left-distributive algebra 

We consider algebras with one binary operation • generated by a single generator that 
we denote by the symbol 1. We shall often write ab instead of a ■ 6, and use the convention 
that abc = (ab)c. 

The left distributive law is the equality 

(LD) a(bc) = ab(ac). 

We let W = W_a be the set of all words built up from 1 using the operation •, denote by = 
(or by =a] the equivalence relation on W given by 

a = b iff (LD) |= a = 6, 

and let A = W/= be the free left-distributive algebra on one generator. 

For the rest of this section, let (A, •) be a left-distributive algebra generated by 1. We 
will summarize the relevant known results on such algebras. 

Definition 2.1. We say that a is a left subterm of 6, or a <l b, if, for some ci,. . . ,Cf- 
(k > 0), b = ac\ . . . Cfc. 
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Lemma 2.2. 

(i) If a <l b and b <l c, then a <l c. 

(ii) If a <l b in A, then ca <l cb in A. 

Proof. Part (i) is trivial. For (ii), use distributivity: if b = ac\ . . .c k , then we have cb = 
c(aci . . . Cfc) = ca(cci) . . . (cc k ) . ■ 

Theorem 2.3 (Dehornoy [2]). For all a, 6 G A, either a = b or a <l b or b <l a. 

(This was also proved by Laver [6] under the assumption that <l is irreflexive.) 
The proof of Theorem 2.3 is quite constructive, using several explicit recursive construc- 
tions on words in W. We will outline the proof of this result below. 

Lemma 2.4 (Dehornoy [2]). If the relation <l on A is irreflexive, then (A, •) is free and 
satisfies left cancellation. 

Proof. Let 7r be the canonical homomorphism of the free algebra A onto A. If a ^ b in A, 
then either a <l b or a >l b, so either na <l ^b or na >l itb, so na ^ nb; therefore, n is 
an isomorphism. For left cancellation, if a ^ b in A, then a<L^or6<^aby Theorem 2.3, 
so ca <l cb or cb <l ca, so ca 7^ cb by irreflexivity. ■ 

Theorem 2.5 (Dehornoy [4]). There is an algebra (A, •) on which <l is irreflexive. Con- 
sequently, the free algebra is linearly ordered by <l and satisfies left cancellation. ■ 

Definition 2.6. The depth of a G W is defined recursively as follows: 

depth(l) = 0, 
depth(a&) = max{depth(a), depth(6)} + 1. 

The herringbone u k of depth k is also defined recursively: 

u = 1, 

U k +1 = lltfc. 

One can also define the full word Vk, the maximal word of depth k, by vo = 1 and 
Vk+i = VkVk- Then Vk is equivalent to Uk, because an easy induction shows that lvk = Vk+i- 

Lemma 2.7 (Dehornoy [2, Cor. 2]). If a is a word of depth < k, then auk = itfc+i in A. 

Proof. By induction on the depth of a (for all k simultaneously). For a = 1, this is 
immediate from the definition of Uk- If a has positive depth, then a = be where b and c 
have depth smaller than that of a, and hence < k — 1. Now the induction hypothesis gives 

abu k = ab{au k -i) = a(bu k -i) = au k = u k+1 , 
as desired. ■ 

For a G W, we write a ^>ld b when b results from a by a single application of (LD) 
from left to right (to a subword of a), i.e., replacing x(yz) by xy(xz). We write a — > b if 
there is a sequence ao = a, a\, a2, ■ ■ ■ , a k = b (k > 0) such that ^ld for each i < k. 
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Proposition 2.8 (Dehornoy [1]). There is a mapping d from W to W with the following 
properties: 

(1) a — > da; 

(2) if a ~^ld b, then 6 — > da; 

(3) if a — > b, then da — > db. 

Proof. First define a binary operation (g) on W by recursion on the second argument: 

a <S> 1 = al, 

a® be = (a <S> b)(a <S> c). 

(The effect of a <g> 6 is to distribute a in 6 as many times as possible.) 
Then define d by another recursion: 

01 = 1, 

0(a6) = 0a <g> 06. 

(The word da contains all possible applications of (LD) within a.) 

Now everything used here is (or can be viewed as being) defined by recursion, includ- 
ing — > (in terms of — >ld) and even — >ld- a ^ld b iff either a has the form ai(a2a%) 
and b = (a^) (0103), or a and b have the forms a±a2 and 6162, respectively, and either 
a± — ► £,£> 61 and 02 = 62, or a± = 61 and a^ ^ld 62- One can now prove a sequence of 
statements by straightforward inductions: 

ab — > a ® 6; 
a <g> (6 <g> c) -> (a <g> 6) <g> (a <g> c) ; 
if a — > a', then a <8> 6 — > a' ® 6; 
if 6 — >ip 6', then a <8> 6 -^ld a <S> 6'; (induct on b ^>ld b') 
if b — > 6', then a ® 6 — > a <8> 6'; 
a — > 0a; 
ai«2(aias) — > ai ® 0203; 

if a — >ld 6, then 6 — > da; (induct on a —*ld b) 

if a — > ld 6, then 0a — >■ 06; (induct ona->LD 6) 

if a — > 6, then 0a — > 06. 

This gives the desired properties. ■ 

Lemma 2.9 (Dehornoy [2]). If a <l 6 in W (i.e., a is a left subterm of 6 m W, with no 
use of the distributive law), and 6 — > 6', t/ien i/iere is a /e/fc subterm a' ofb' in W such that 
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Proof. A straightforward induction on the length of the derivation 
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Proof of Theorem 2.3. From Proposition 2.8, it follows that, if a =a b, then a — > d m b, 
whenever m is at least the length of an (LD)-derivation of a = b. Now, let a and b be 
words in W, and choose k so that both words are of depth < k. By Lemma 2.7, we have 
auk = Wfc+i and = it/c+i, so a-u^ — > d m Uk+i and — > d m Uk+i for some m. By 
Lemma 2.9, there are left subterms a' and 6' of d m Uk+i such that a — > a' and 6 — > 6'. 
Since a' and 6' are left subterms of the same word, we have either a' = a' <l or 
b' <l a' in VF; therefore, either a = 6, a <l 6, or 6 <£ a in A ■ 

All of the steps in the proof of Theorem 2.3 are accomplished by explicit recursions 
and inductions (on terms, (LD)-derivations, etc.), and it is easy to see that the recursions 
are in fact primitive recursions (on the depths of terms, the lengths of derivations, etc.). 
Therefore, Theorem 2.3 can be proved in a very basic theory of arithmetic. One such theory 
is Primitive Recursive Arithmetic (PRA), which is formalized in a language containing 
function symbols for all possible function definitions using the constant 0, the successor 
function ', composition, and primitive recursion; it has axioms stating that the function 
symbols satisfy their definitions, and that 0' 7^ 0, and a rule of inference allowing induction 
on quantifier-free formulas. (See Sieg [8] for more details.) This theory is among the 
weakest of the commonly-studied fragments of arithmetic; it is often referred to as the 
formal version of what Hilbert meant by 'finitary reasoning.' It is not hard to show that 
the methods used to prove Theorem 2.3 can be formalized in this theory, so Theorem 2.3 
is provable in PRA. 

Now consider algebras with two binary operations • and o. We use the convention 
ab o c = (ab) o c, a o be = a o (be). Let W-p be the set of all words built up from 1 using 
both operations, and let V be the free algebra on one generator under the equivalence 

a =-p b iff (LL) \= a = b, 

where (LL) is the following set of axioms (Laver [6]): 

a o (6 o c) = (a o b) o c 
(a o b)c = a(bc) 
a(b o c) = abo ac 
a ob = abo a 

Note that (LD) is a consequence of (LL): 

a{bc) = (a o b)c = (ab o a)c = ab(ac). 

The motivation for axioms (LL) comes from large cardinal theory. Let V\ be the col- 
lection of all sets of rank less than A, where A is a limit ordinal. Under the assumption 
that there exists a nontrivial elementary embedding j from V\ to V\, let us consider the 
algebra (Aj, •) generated from j by the operation of application 

j.k= \Jj(knv a ) 



(LL) 
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and the algebra (Pj, ■, o) generated from j by • and composition of embeddings. Laver [6] 
shows, among other things, that (A,, •) and {Pj, ■, o) are respectively the free monogenic 
left-distributive algebra and the free monogenic algebra satisfying axioms (LL). 

Again, we summarize some known facts about the algebras (P, -, o). 

Let P be an algebra with one generator 1 satisfying (LL). Let A C P consist of all 
values in P of words in Wx, A satisfies (LD) and is generated by 1. 

Conversely, one can construct an algebra P from an algebra A. The following construc- 
tion is implicit in Laver [6], and described explicitly in Dehornoy [3, Prop. 2]. 

Proposition 2.10 (Laver, Dehornoy). Any algebra (A, •) satisfying (LD) can be extended 
and expanded to an algebra (P, ■, o) satisfying (LL). 

Proof (sketch). Given (A, •), let P D A be the set of formal compositions of one or more 
elements of A, with two such formal compositions identified if their equality can be deduced 
from associativity of composition and the rule a o b = ab o a. Define o and • for two such 
compositions a\ o ■ ■ • o a n and b\ o ■ ■ ■ o b m by 

(ai o • • • o a n ) o (b\ o • • • o bm) = a\ o • • • o a n o b\ o • • • o b m , 
o • • • o a n ) ■ (6i o • • • o b m ) = ai(. . . (a n (&i)) • • •) ° ' ' • • • (a n (b m )) ...). 

This is well-defined on P and satisfies (LL). ■ 

Note that, if A is generated by 1 using •, then P is generated by 1 using ■ and o. 

Lemma 2.11. Every element of the free (LL)-algebra V can be written in the form a\ o 
■•■oo„ for some ai, . . . , a n G A. 

Proof. Induct on the form of p as a word in W-p. If p = 1, we are done. Otherwise, p has 
the form qr or q o r, where we may assume that q = a± o ■ ■ ■ o a n and r = b\ o ■ ■ ■ o b m with 
bj E A. We then have 

q o r = a± o ■ ■ ■ o a n o b\ o ■ ■ ■ o b m 

and 

qr = ci o • • • o Cm, 

where Cj = a\{a2{. ■ ■ (a n (bj)) ■■■)), so p has the desired form. ■ 

In the following proposition, the left-to-right implication is part of Lemma 3 of Laver [6] , 
while the right-to-left implication uses Lemma 3.2 of that paper. 

Proposition 2.12. Let (P,-,o) be an algebra satisfying (LL) and generated by 1, and let 
(A, •) be the subalgebra of (P, ■) generated by 1. Then P is free (with respect to (LL)) if 
and only if A is free (with respect to (LD)). 

Proof. First, note that each term a G W_a is either 1 or of the (unique) form a±b for 
some b. The same statement can be made about b, and so on; we eventually find that 
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each such a has a unique expression of the form a\{a 2 {. ■ ■ (a n (l)) . . . )) for some n > and 
ai,...,a n G W A . 

The next fact (Laver [6, Lemma 3.2]) we will use is that, if n, m > 1, a^, bj G Wj,, and 
ai (a 2 (. . . (a n (l)) ...))=a bi(b 2 (. . . (6 m (l)) ...)), 

then 

ai ° • • • ° an =p h o • • • o b m . 

It will suffice to show that, if a\{a 2 {. ■ ■ (a n (l)) . . . )) fri(&2(- • • (b m (l)) ■ ■ ■ )), then 

a\ o ■ ■ ■ o a n =-p b\ o ■ ■ ■ o b m , since then one can induct on (LD)-derivations. (Note that an 
application of left distributivity cannot start or finish with the term 1, so no term other 
than 1 is equivalent to 1 under = A .) If ai(a 2 (. . . (a n (l)) . . . )) ^ld bi(b 2 (. ■ ■ (b m (l)) ...)), 
then there are two cases: either the application of left distributivity occurs within a single 
term a^, or it changes ai(cii+i(x)) into aiCii+i(ai(x)) for some i. In the first case, we get 
from cii o • • • o a n to b\ o • • • o b m by applying left distributivity within in the second 
case, we get from ai o • • • o a n to b\ o • • • o b m by replacing o aj + i with aia^i o a^. Both 
of these changes are permitted by (LL), so a± o • • • o a n =-p b\ o • • • o b m . 

We are now ready to show that, if A is free, then P is free. Assume A is free, and let 
p, q G W-p be words such that p = q in P; we must show that p =-p q. By Lemma 2.11, 
there are n, m > 1 and a^, bj G W A such that p =j> a\ o ■ ■ ■ o a n and q =-p bi o • • • o b m . 
Since p = q in P, pi = ql in P, so {a\ o ■ • • o a n ) -1 = (&i o • • • o 6 m ) • 1 in P, so 
ai(a2(. • • (a n (l)) . . . )) = 6i(62(- • • (^m(l)) • • • )) in P and hence in A. Since A is free, we 
have ai(a 2 (. ■ ■ (a n (l)) . . . )) = A bi(b 2 (. . . (b m (l)) ...)). Now the preceding paragraph gives 
cii o • • • o a n =-p bi o • • • o 6 m , so p =-p q, as desired. 

Now assume that P is free; we must show that A is free. To do this, we will show that, 
if a,b G W_a and a b, then a ^ & in A. By Proposition 2.10, there is an algebra P' 
extending the free algebra A which satisfies (LL). Since a 6, we have a ^ b in P', so 
a ^-p b. Since P is free, a 7^ 6 in P and hence in A. Therefore, A is free. ■ 

It is not hard to see that the proof of Propostion 2.12 can be carried out in PRA; one 
merely has to use the proof of Proposition 2.10 rather than the proposition itself when 
showing "if a 6, then a ^-p b." 

Now consider the algebras A, and Pj of elementary embeddings. For each nontrivial 
elementary embedding from V\ to itself, let cr(a) be the critical point of a, the least ordinal 
moved by a. Let V be the set of all critical points of elements of Aj. We note that 

cr(a&) = a(cr(6)), cr(a 06)= min(cr(a), cr(6)). 

Consequently, the critical point of every a G Pj is in T, and every a G Pj maps Y into Y. 
Theorem 2.13 (Laver and Steel [7]). The set Y has order type uj. ■ 
Theorem 2.14 (Laver [7]). For every a, b G Aj, if a ^ b, then 0(7) 7^ 6(7) for some 

7 g r. ■ 

Let kq be the critical point of j, and, for all n, let K n+ i = j(K n ). 
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Lemma 2.15. 

(i) If a e Aj has depth at most n, then a(« n ) = K n+ \. 

(ii) For every a G Pj, there are natural numbers d > and N such that a(n n ) = K n +d 
/or a// n> N . 

Proof, (i) By induction on the depth of a: 

a6(K n ) = a6(a(K n _i)) = a(6(« n _i)) = a(« n ) = /c n+ i. 

(ii) By Lemma 2.11, we have a = a± o ■ ■ ■ o ad for some a±, . . . ,a,d & Aj. ■ 

To conclude this section, we remark that one can adjoin to Pj the identity embedding id. 
The extended algebra still satisfies axioms (LL), as well as these rules: 

id • a = a, a • id = id, a o id = id o a = a. 

3. A SEQUENCE OF FINITE ALGEBRAS 

In this section, we will construct, for each natural number n, an algebra A' n on the set 
{1,2,..., 2 n } with a binary operation * n satisfying the left distributive law. We will then 
construct a second operation o n on this set so that the resulting two-operation algebra P' n 
satisfies (LL). The subscripts on the operations will sometimes be omitted while a fixed n 
is being considered. 

The construction of these algebras is due to Laver; Wehrung proved some additional 
properties of them. The proof of the following theorem has been reconstructed indepen- 
dently by several people, including the authors; the presentation here is similar to that of 
Wehrung [9]. (See also Dehornoy [3, Prop. 7].) 

Theorem 3.1' (mostly Laver). Let n > 0. 

(a) There is a unique left- distributive operation * n on {1,2,..., 2 n } such that 

a * n 1 = a + 1 for all a < 2 n , and 2 n * n 1 = 1. 

(b) There is a unique additional operation o n on {1,2,..., 2 n } such that * n and o n 
satisfy axioms (LL). 

The operation * n is defined by double recursion; a * n b is defined by an outer descending 
recursion on a and an inner ascending recursion on b. The recursive formulas are as follows: 

(3.1a) 2 n * n b = b; 

if a < 2 n , then 



(3.1b) 



a * n 1 = a + 1; 
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if a < 2 n and b < 2 n , then 

(3.1c) a * n (6 + 1) = (a * n b) * n (a + 1). 

In order to see that this is a valid recursion, we must maintain the inductive condition 
(3.2') a* n b>a ifa<2 n . 

This clearly holds for a * n 1. For a * n (b + 1) with a < 2 n , we have a * n b > a by the 
induction hypothesis, so (a * n b) * n (a + 1) has already been defined. If a * n b = 2 n , then 
a* n (6 + 1) = 2 n * n (a + 1) = a + 1 > a; if a* n b < 2 n , then a* n (6+l) = (a* n b)* n (a + 1) > 
a * n b > a. Therefore, (3.2') holds for a * n (b + 1) as well, so the recursion can continue. 

The equations (3.1) can be deduced from left distributivity and the equations a * n 1 = 
a + 1 (a < 2 n ) and 2 n * n 1 = 1. This is obvious for (3.1b); for (3.1c) and (3.1a), we have 

a * n (b + 1) = a * n (b * n 1) = (a * n b) * n (a * n 1) = (a * n 6) * n (a + 1), 
2 n * n b = T * n (1 * n • • • * n 1) = (2" * n 1) * n • • • * n (2 n * n 1) = 1 * n • • • * n 1 = b. 

S v ' ^ ' V v ' 

b times b times b times 

This proves the uniqueness part of Theorem 3.1' (a). 

An easy induction on b shows that the equations (3.1) hold even when a = 2 n , if we 
treat addition as being modulo 2 n . (Since we are working with the set {1, 2, . . . , 2 n }, 
it will be convenient to treat reduction modulo 2 n as a mapping into this set; we will 
write u x mod' 2 n " to mean the unique member of {1, 2, . . . , 2 n } which is congruent to x 
modulo 2 n . In particular, mod' 2 n will be 2 n .) We will soon show that the equations 
also hold for b = 2 n , and prove several other useful properties of A' n at the same time. 

For any fixed a, consider the sequence a * n 1, a * n 2, . . . , a * n 2 n in A' n . If a = 2 n , this 
sequence is just 1,2,..., 2 n . If a < 2 n , then the sequence begins with a + 1, and (by (3.1c)) 
each member is obtained from its predecessor by operating on the right by a + 1; hence, 
by (3.2'), the sequence must be strictly increasing as long as its members remain below 2 n . 
Once 2 n is reached (as must happen in at most 2 n — a steps), the next member will be 
a + 1 again, and the sequence repeats. Therefore, the sequence a * n 1, a * n 2, . . . , a * n 2 n 
is periodic (as long as it lasts); each period is strictly increasing from a + 1 to 2 n . We will 
refer to the number of terms in each period of this sequence as the period of a in A' n . (The 
period of 2 n in A' n is 2 n .) 

Proposition 3.2. 

(a) The period of any a in A' n is a power of 2; equivalently, a * n 2 n = 2 n for all a. 

(b) The formulas (3.1) hold modulo 2 n in A' n even when a or b is 2 n . 

(c) Reduction modulo 2 n is a homomorphism from A' n+1 to A' n : 

(a * n+ i b) mod' 2 n = (a mod' 2 n ) * n (b mod' 2 n ) 



for all a, b in A' r 
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(d) For any a < 2 n in A' n , ifp is the period of a in A' n , then the period of a + 2 n in A' n+1 
is also p, and the period of a in A' n+1 is either p or 2p. The period of2 n in A' n+1 is 2 n . 

Proof. By simultaneous induction on n. Part (a) for n = is trivial. 

Suppose (a) holds for n. We noted before that the formulas (3.1) hold modulo 2 n when 
a = 2 n . If a < 2 n but b = 2 n , then (6 + 1) mod' 2 n = 1 and a* n l = a+1, while a* n b = 2 n 
by (a), and 2 n * n (a + 1) = a + 1, so (3.1c) holds even in this case. Therefore, (b) holds 
for n. 

Part (c) for n is proved by induction, downward on a and upward on 6, as in the 
definition of * n +i- If a = 2 n+1 , then both sides are equal to b mod' 2 n . If b = 1, then both 
sides are equal to (a + 1) mod' 2 n . If a < 2 n+1 and b > 1, then the left side is equal to 

((a mod' 2 n ) * n ((b - 1) mod' 2 n )) * n ((a + 1) mod' 2 n ) 

by the induction hypothesis, and the right side is also equal to this value by (b) . Therefore, 
(c) holds for n. 

Next, consider (d). Clearly the period of 2 n+1 in A' n+l is 2 n+1 , twice the period of 2 n 
in A' n . Now suppose a < 2 n , and let p be the period of a in A' n . By (c), for each b 
in A' n , a * n+1 b and (a + 2 n ) * n+ i b must each be equal to either a * n b or (a * n b) + 2 n ; if 
a * n b < 2 n , then both of these values are less than 2 n+1 . It follows that the periods of a 
and a + 2 n in A' n+1 are at least p. Furthermore, by (3.2'), we must have (a + 2 n ) b > 
a + 2 n , so (a + 2 n ) * n +i b must be equal to (a * n 6) + 2 n for all such 6, so, in particular, 
(a + 2 n ) * n+ i p = 2 n+1 ; hence, the period of a + 2 n in A' n+1 is exactly p. (The same 
argument shows that the period of 2 n in A' n+1 is 2 n .) For the period of a in A' n+1 , there 
are two cases. If a * n +i P = 2 n+1 , then the period of a in A' n+1 is p, and we are done. If 
not, a* n+ ip must be 2 n . Then a* n+1 (p + 1) must be either a + 1 or a + 1 + 2 n by (c), and 
it must be greater than 2 n because a * n +i b increases with b until it reaches 2 n+1 , so we 
must have a * n +i (p+1) = a + 1 + 2 n = (a * n 1) + 2 n . Similarly, using part (c) along with 
(3.1c) and (3.2'), we see that a * n +i (p + b) = (a * n b) + 2 n successively for b = 2, 3, ... ,p. 
In particular, a * n+1 b < 2 n+1 for b < 2p and a * n+1 2p = 2 n+1 , so the period of a in A' n+1 
is 2p. This completes the proof of (d) for n. 

Finally, (a) for n + 1 (in the first phrasing) follows immediately from (a) and (d) for n. 
This completes the induction. ■ 

Given these properties of A' n , the proof that the left distributive law holds in A' n is a 
straightforward triple induction (downward on a and 6, upward on c): 

T * (b * c) = b * c = (2 n * b) * (2 n * c); 
a * (2 n * c) = a * c = 2 n * (a * c) = (a * 2 n ) * (a * c); 

if a, b < 2 n , then 

a * (b * 1) = a * (b + 1) = (a * b) * (a + 1) = (a * b) * (a * 1); 
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and, furthermore, if c < 2 n , then 

a * (b * (c + 1)) = a * ((b * c) * (b + 1)) 

= (a * (b * c)) * (a * (b + 1)) [6 * c > b] 

= ((a * 6) * (a * c)) * ((a * 6) * (a + 1)) 

= (a * b) * ((a * c) * (a + 1)) [a*b > a] 

= (a * 6) * (a * (c + 1)). 

We now want to define a second operation o = o n so that the resulting algebra 

P; = ({l,2,...,2-},* n ,o n ) 

satisfies Laver's axioms (LL). In particular, it will have to be true that (ci o n b) * n 1 — 
*n (b * n 1); therefore, we must define 

a o n b = (a * n (b + 1)) - 1, 

where the addition and subtraction are performed modulo 2 n . (So we immediately get 
the uniqueness in Theorem 3.1'(b).) This definition makes it immediate that reduction 
modulo 2 n is a homomorphism from P^ +1 to P' n . We now proceed to prove the four laws 
(LL). All addition and subtraction below is modulo 2 n . 
First, one can show that 2" o j = j o 2" = j as follows: 

2 n oi = 2"*(x + l)-l = (x + l)-l = x, 
x o 2 n = x * (2 n + 1) - 1 = (x * 1) - 1 = x. 

The proof of (a o b) * c = a * (b * c) is by induction on c: 

(a o b) * 1 = (a o b) + 1 = a * (b + 1) = a * (b * 1); 

(a o b) * (c + 1) = ((a o 6) * c) * ((a o b) + 1) = (a * (b * c)) * (a * (6 + 1)) 

= a * ((b * c) * (b + 1)) = a * (b * (c + 1)). 

Next, aob=(a*b)oa because 

(a o 6) + 1 = a * (b + 1) = (a * b) * (a + 1) = ((a * 6) o a) + 1. 
The proof of the associative law a o (b o c) = (a o 6) o c is as follows: 

(a o (6 o c )) + 1 = a * ((6 o c) + 1) 

= a * (b * (c + 1)) 

= a * ((6 * c) * (6 + 1)) 

= (a * (b * c)) * (a * (b + 1)) 

= ((a o 6) * c) * ((a o 6) + 1) 

= (a oft) * (c+ 1) 

= ((a o 6) o c ) + 1. 



12 



RANDALL DOUGHERTY AND THOMAS JECH 



Finally, to prove that a * (b o c) = (a * b) o (a * c), proceed by induction downward on b. 
For b = 2 n , we have 

a * (2 n o c) = a * c = 2 n o (a * c) = (a * 2 n ) o (a * c). 

If b < 2 n , then 



a * (6 o c) = a * ((6 * c) o 6) 

= (a * (b * c)) o (a * b) [b * c > b] 

= ((a * b) * (a * c)) o (a * 6) 
= (a * 6) o (a * c). 

This completes the proof that P' n satisfies (LL), so Theorem 3.1' is proved. 
The following fact will be useful later: 

(3.3') if a^2 n or b^2 n , then aob^2 n . 

This is proved by cases. If a ^ 2 n , then a * (b + 1) > a by (3.2'), so a * (b + 1) ^ 1, so 
a o 6 ^ 2 n . If a = 2 n but 6 ^ 2 n , then a o 6 = 6 ^ 2 n . 

We remark that Theorem 3.1' can be rephrased slightly, replacing 2 n by 0: 

Theorem 3.1 (same credits as for 3.1'). There are unique operations * n and o n on A n = 
Pn = {0, 1, . . . , 2 n — 1} such that the axioms (LL) hold and, for all a G P n , 

a * n 1 = a + 1 mod 2 n . 



This has no effect on the structure of the algebras, but it affects statements referring to 
the ordering of the elements of the algebra. In particular, (3.2') and (3.3') become: 

(3.2) either a * n b = or a * n b > a; 

(3.3) if a ^ or 6^0, then oo&^0. 

Also, the ordinary mod operation now gives the homomorphism from P n +i to P n . 

The element (or 2 n ) of the algebra plays the role that the identity embedding played 
at the end of section 2: 



0*a = a, a*0 = 0, ao0 = 0oa = a. 
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4. The limit algebras and P^ 

Using the finite algebras (A n , * n ) and (P n ,*n,°n), we construct monogenic algebras 
(Aqo, •) and (Poo, •, °)- Let W_a C W-p be the sets of words built up from 1 using • and us- 
ing •, o, respectively. The set of all positive integers can be embedded in W A by identifying 
each positive integer with a word in W^, by recursion: 

1 = 1, 
a + 1 = a ■ 1. 

We also adjoin to Wp, letting W£ = W v U {0} and W\ = W A U {0}, and add rules 

• a = a, a • = 0, ao0 = 0oa = a. 

For every word a G W-p and every n > 0, let [a] n be the value of a in P n = {0, 1, . . . , 2 n — 1}, 
and consider the equivalence relation defined by: 

a =oo b iff [a] n = [b] n for all n > 0. 

We let and Poo be, respectively, the quotients by =00 of W A and W-p. Clearly A^ 
and Poo are generated by 1; also, they satisfy (LD) and (LL), respectively, because A n 
and P n do. (In fact, an equivalent definition for A^ and P^ is that they are the subalgebras 
generated by 1 of the inverse limits of the algebras A n and P n , respectively.) Moreover, 
Aoo Q Poo- We shall investigate the possibility that A^ or P^ is free. 

Lemma 4.1. Por every a G Wp and every n, [a] n +i is either [a] n or [a] n + 2 n . 

Proof. This follows immediately from the fact that reduction modulo 2 n is a homomor- 
phism from P n+ i to P n . ■ 

Note that, as a corollary, if [a] n 7^ 0, then [a] n +i 7^ 0. 

Definition 4.2. Let a G W-p be such that [a] n 7^ for some n. The signature s(a) of a is 
the largest n such that [a] n = 0. 

By Lemma 4.1, for each n > s(a), 2 s ( a ' ) is the largest power of 2 which divides [a] n . 

Lemma 4.3. Let a, 6 G Wp be such that [b] n 7^ for some n. Then, for every n > 0, 

[ab] n = iff [a- 2 s ^] n = 0. 

Proof. If [a • 2 s ^] n = 0, then [a] n * n [2 s ^] n = 0, so 2 S ^ is a multiple of the period of [a] n 
in P n . But [6] n is a multiple of 2 S ( 6 ), so [a] n * n [b] n = 0, so [ab] n = 0. 

On the other hand, suppose [ab] n = 0; then [ [b] n = 0. If s(6) > n, then [2 s ^] n = 0, 
so [a] n * n [2 s ( & )] n = 0. If s(b) < n, let q be the period of [a] n in A n ; then q divides [6] n , 
and since o is a power of 2, g divides the largest power of 2 dividing [6] n , which is 2 s ( b ). 
This again gives [o]n *n 

[2 s(b) ] n = 0. Hence, in either case, [a ■ 2 s{b) ] n = 0. ■ 
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Corollary. s(ab) = s(a ■ 2 S ^). 

Theorem 4.4. The following are equivalent: 

(i) (Ax,, •) is free. 

(ii) (Poo, •, o) is free. 

(iii) satisfies the left cancellation law. 

(iv) <l on is irreflexive. 

(v) If a <l b in A^, then [a] n < [b] n for all but finitely many n. 

(vi) For every a € W^, there is an n such that [a] n 7^ 0. 

(vii) For every k > 0, there is an n such that [uk] n 0- 

(viii) For every k > 1, there is an n such that [1 • k] n 7^ 0. 

Proof. 

(i)<->(ii): Proposition 2.12. 
(i)— >(viii): Assume that, for all n, [1 • k] n = 0. Then, in each A n , 1 * (k + 1) = (1 * k) * 2 = 
0*2 = 2 = 1*1. However, it is easy to see that the word 1 ■ 1 is inequivalent in 
the free algebra to any other word, because no application of the distributive law 
can start from or result in 1-1. Therefore, A is not free, 
(viii) — > (vii): By induction on k > 0, we prove that [uk] n 7^ for some n. Assume that this is 
true for k, and let s = s(v,f-) be the signature of u\~. Let n be such that [1 • 2 s ] n 7^ 0. 
By Lemma 4.3, we have [itfc+i] n = [1 • Uk]n 7^ 0. 
(vii)— >(vi): Let k be the depth of a. We show that, if [a] n = 0, then [u] n = 0, where u = uu- 
By Lemma 2.7, au = Uk+i = uu. If [a] n = 0, then [au] n = * [u] n = [u] n ; since 
[u] n * [u] n is either or >[u] n by formula (3.2), we have [u] n = 0. 
(vi)-^(v): If [a] m 7^ for some m, then [a] n 7^ for all n > m. Suppose a <l b, say 
b = aci...Ck- Let n be sufficiently large that [a] n 7^ 0, [aci] n 7^ 0, [aciC2] n 7^ 
0, . . . , [b] n ^ 0. By (3.2), we have [a] n < [aci] n < • • • < [b] n . 
(v)— >(iv): Trivial, 
(iv)— >(iii): Lemma 2.4. 

(hi)— >( viii): As for (i)— >(viii), if [1 • k] n = for all n, then 1 • (k + 1) = 1 • 1 in A^, violating 
left cancellation, 
(iv)— >(i): Lemma 2.4. 



All of the steps here can be formalized in primitive recursive arithmetic, so Theorem 4.4 
is a theorem of PRA. 



5. Embedding algebras 

In this section, we consider algebras of increasing functions from w to w which imitate 
the behavior of the algebra of elementary embeddings from Laver [6] when restricted to 
the set of critical points. The existence of such algebras will turn out to be equivalent to 
the properties in Theorem 4.4. Moreover, this equivalence can be proved (and formulated) 
in primitive recursive arithmetic. 
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Let id be the identity function on u. If f:ui — > u> is strictly increasing and different 
from id, let cr(/) be the least n such that f(n) > n (the critical point of /). 

Definition 5.1. An embedding algebra is a structure (A,-) where A is a collection of 
strictly increasing functions from u to u>, ■ is a left-distributive binary operation on A, 
and, for every a, 6 G A with b ^ id, cr(a • b) = a(cr(6)). 

As usual, we will often write ab instead of a ■ b. The set A need not contain the identity 
function, but, if it does not, one can extend the operation • to AU {id} in the obvious way: 
a ■ id = id, id • a = a. 

An embedding algebra A is nontrivial if it has an element other than id. Note that 
the set of non-identity elements of A is closed under •: if b has a critical point, so does 
a ■ b. Also, if A is nontrivial, then A has infinitely many critical points: if n = cr(a), then 
o(n) = cr(aa) and a(n) > n. 

The main goal of the next three sections will be to prove the following theorem. 

Theorem 5.2. The statement "There exists a nontrivial embedding algebra" is equivalent 
to the statement "Aoo is free". 

When proving that u A (yD is free" implies the existence of an embedding algebra, we shall 
see that there is a natural way of associating increasing functions from w to w with words 
in Wj_. However, it is not easy to prove that inequivalent words yield distinct functions. 
Here we shall rely on Theorem 2.14, but first we have to develop techniques to 'miniaturize' 
Laver's proof. This will be done in Section 6. In order to develop the necessary machinery, 
we first define a different kind of 'embedding algebra.' The new definition will include 
much of Laver's machinery explicitly; the resulting structure will be much less concrete 
but more amenable to algebraic manipulation. 

Definition 5.3. A two-sorted embedding algebra consists of a nonempty set 8 (the 'em- 
beddings,' for which we will use variables a,b, . . .) and a nonempty set O (the 'ordinals,' 
for which we will use variables a, (3, . . .), together with binary operations • and o on S, a 
binary relation < on O, a constant id G 8, an application operation a,/? i— > a((3) (which 
will often be written without parentheses) from 8 x O to O, a function cr:8— {id} — > O, 
and a ternary relation = C 8 x O x 8, satisfying the following axioms: 

• The relation < is a linear ordering of O. 

• Embeddings are strictly increasing monotone functions: 

(3 < 7 implies a(3 < 07, and a(3 > (3. 

• For all a 7^ id, a(cr(a)) > cr(a). 

• The operation o represents composition: (a o 6)7 = 0(67). 

• The constant id represents the identity: 

id (7) =7, a ■ id = id, and id • a = a o id = id o a = a. 

• The axioms (LL) hold. 
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• For each 7, = 7 is an equivalence relation on £ which respects • and o (i.e., if a = 7 a' 
and b = 7 then a ■ b = 7 a' ■ V and a o b = 7 a' o b'). 

• If 7 < <5 and a = s 6, then a = 7 6. 

• If a = 7 6 and ad < 7, then a5 = bS. 

• For any a 7^ id, a = cr ( a ) id. 

• Coherence: a = 7 b implies ca = C7 c&. 

It follows from these axioms that the operation ■ distributes over itself and application: 

a(bc) = ab(ac) and 0(67) = ab(a^). 
A few more properties also follow easily: 

Proposition 5.4. In a two-sorted embedding algebra, if a and b are embeddings different 
from id, then: 

(1) cr(a) is the least ordinal moved by a; 

(2) cr(a6) = a(cr(6)); and 

(3) cr(a o b) = min(cr(a), cr(6)) . 

Proof. It is given that a(cr(a)) > cr(a); if (3 < cr(a), then the fact that id = cr ( a ) a implies 
that (3 = id(/3) = a{(3). Since id = cr ( 6 ) 6, coherence gives id = a ■ id = a ( cr ( 6 )) ab, so 
ab does not move any ordinal less than a(cr(6)); but it moves a(cr(6)) to a6(a(cr(6))) = 
a(b(cr(b))) > a(cr(6)), so we must have cr(a6) = a(cr(6)). For (3), let 7 = min(cr(a), cr(6)). 
Then, since = 7 respects o, we have id = 7 a o 6, while (a o 6)7 = a(b"f) > max(a7, 67) > 7, 
so 7 is the least ordinal moved by a o b. ■ 

It is easy to verify that all of the axioms in Definition 5.3 are preserved when one 
moves to a substructure (replacing £ and O with smaller sets closed under the operations, 
and restricting the operations and relations accordingly). In particular, if one keeps the 
same S but replaces O with the range of the function cr (assuming that £ 7^ {id}), then 
Proposition 5.4(2) implies that the new sets are closed under the operations, so one obtains 
a new two-sorted embedding algebra in which every ordinal is a critical point. 

If desired, one can restrict £ to the embeddings obtained from a single embedding j id 
using • and o, along with id; this gives a two-sorted embedding algebra generated by a single 
embedding. From now on, we will call a two-sorted embedding algebra monogenic if its 
non-identity embeddings are generated from a single non-identity embedding via • and o. 
Similarly, an embedding algebra is monogenic if it is generated from a single non-identity 
embedding via •; any nontrivial embedding algebra has monogenic subalgebras. Note that 
a monogenic embedding algebra does not contain the identity function. 

The results of Laver [6] show that one can make the set of all elementary embeddings 
from V\ to itself into a two-sorted embedding algebra by letting O be the set of limit 
ordinals less than A and defining = 7 to be = (as defined in Laver [6], Section 2). We now 
want to show that just the simple properties of embedding algebras suffice to construct 
the more elaborate apparatus of a two-sorted embedding algebra. 
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Proposition 5.5. If a nontrivial embedding algebra exists, then there exists a two-sorted 
embedding algebra in which the ordinals have order type ui. 

Proof. Let such an embedding algebra be given; we will construct a two-sorted embedding 
algebra. The ordinal set O will be the set of critical points from the given algebra; this is 
an infinite subset of ui, so it has order type u>. The embedding set and the operations and 
relations will be built up in several steps. 

To start with, let Si be the set of non-identity embeddings in the given algebra. As 
noted before, this set is closed under •. Now the following properties are true of Si and O: 

• The left distributive law holds. 

• P < 7 implies a(3 < 07. 

• 0(7) > 7. 

• a(cr(a)) > cr(a). 

• 0(7) = 7 for 7 < cr(a). 

• cr(a6) = a(cr(6)). 

We also have the property 

• ab(aj) = 0(67), 

since every ordinal 7 is a critical point and 

ab(a(cr(c))) = ab(cr(ac)) = cr(ab(ac)) = cr(a(bc)) = a(cr(bc)) = a(6(cr(c))). 

Now use the construction from Proposition 2.10 to extend and expand (Si,-) to an 
algebra (£2,-,°) satisfying Laver's laws (LL). The application operation on these new 
embeddings is defined naturally: each embedding a is a formal composition (ai o • • • o a n ) 
of members of Si, and we let 0(7) = a,i(a,2(- ■ ■ a n d) ■ ■ ■ ))■ We have a,iai+i(ai(6)) = 
ai(ai+i(5)) for any 5, so replacing o a$+i with aiai + i o ai in the formal composition does 
not change the resulting value of 0(7); since formal compositions were identified only when 
one could transform one into the other by such replacements and/or the reverse, the value 
ad) is well-defined. Also, let cr(a) be the minimum of cr(ai), . . . , cr(a n ); this is the least 7 
such that 0(7) > 7, so it also does not depend on the expression for a. Then we have: 

• (LL) holds. 

• (a o 6)7 = a(b r y). 

• cr(a ob) = min(cr(a), cr(b)). 

And the properties listed before hold for £2 as well. 

Let S be S 2 U {id}, where id is a new embedding for which cr(id) is not defined but the 
other operations are defined by: 

• id(7) = 7, a ■ id = id, and id • a = a o id = id o a = a. 

Again the previous properties continue to hold. Now it only remains to define a = 7 b so 
that the rest of the axioms in Definition 5.3 hold. 

Lemma 5.6. Assume the facts listed above. Let a, bi, ... , bk be embeddings, where k > 0, 
and let 7 be an ordinal. 
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(i) // cr(a) > &1&2 • • • hi, then ab x b 2 • • • hi = hh • • • hi- 

(ii) If cr(a) > a6i6 2 ■ ■ ■ hi, then ahb 2 ■ ■ ■ hi = hb 2 ■ ■ ■ hi- 

Proof. These are both proved by induction on k (simultaneously for all embeddings). Let 
us write (i m ) for the case k = m of (i), and similarly for (ii). Note that the hypotheses of 
(i) and (ii) each imply that cr(a) > 7. 

(10) : This just says that a does not move any ordinal below its critical point. 

(11) : a&i7 = 061(07) = 0(617) = hi- 

(ifc) for k > 2: Let s = aha. Note that 5(06162) = 0610(06162) = 061(062) = 0(6162)- 
Also note that cr(s) = o6i(cr(o)) > cr(a); similarly, cr(ws) > cr(a) for any w. In particular, 

cr(6i6 2 • • • 6 fc _is) > cr(o) > 6i6 2 • • • 6^7, 

so (ii) gives 

6i6 2 • • • b k -i(sb k )i = 6i6 2 • • • 6 fc _is(6i6 2 • • • b k -ib k )i = 6i6 2 • • • b k -ib k l- 
We now have 

cr(6i6 2 • • • h- 2 s) > cr(o) > 6 X 6 2 • • • b k -i(sb k )l, 
so, if k > 2, we can apply (i 2 ) to get 

6162 • • •6fc- 2 (s6fc-i)(s6 fc )7 = 6i6 2 • • •6 fc _2s(6i6 2 • • • b k - 2 b k -\){sh)l 

= hb 2 ■ ■ ■ b k -\{sb k )~i = 6i6 2 • • • 6 fc _i6fc7. 

We can now apply (13) to 6162 • • • b k ss, and so on all the way to (U-2), to get 

6i6 2 (s6 3 )(s6 4 ) • • • (s6 fc )7 = 6i6 2 • • -6^7. 

Now we have 

s(a6i6 2 • • • 6^7) = s{ahb 2 ) (s6 3 ) (s6 4 ) • • • (sb k ) (57) 
= a(6i6 2 )(s6 3 )(s6 4 ) • • • {sb k )^ 
= 6i6 2 (s6 3 )(s6 4 ) • • • (s6 fc )7 by (i fe _i) 

= 6i6 2 • • -6fc7 
= s(6i6 2 • • -6fc7). 

Since s maps distinct ordinals to distinct ordinals, we get 06162 • • • 6fc7 = b\b 2 - ■ ■ h^. 
(ho): We have cr(a) > 07 > 7, so (io) applies. 

(hi): 0617 = 061(07) = 0(617) ^ hi-, so cr(a) > 617, so (ii) applies, 
(ii/c) for k > 2: Again let s = aha. We now have 

cr(ws) > cr(s) > cr(a) > 06162 • • -6fc7 
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for any w. This gives 



abib 2 



bkl = s(abib 2 ■ • -6fc7) 

= s(ab 1 b 2 )(sb 3 )(sb 4 ) ■ ■ ■ (sb k )(s^y) 

= a(b 1 b 2 )(sb 3 )(sb 4 ) ■ ■ ■ (sb k h 

= b 1 b 2 (sb 3 )(sb 4 ) ■ ■ ■ (s6 fc )7 

= b 1 b 2 s(b 1 b 2 b 3 )(sb 4 ) ■ ■ ■ (sb k )^ 

= hb^isb^ ■ ■ ■ (s6 fc )7 

= b 1 b 2 b 3 s(b 1 b 2 b 3 b 4 )(sb 5 ) ■ ■ ■ (sb k )^ 

= 61626364(865) ■ ■ ■ (s6 fc )7 



by (iifc-i) 



by (iifc-2) 



by (iifc-3) 



= bib 2 ■ ■ ■ 



hi, 



by (iii) 



as desired. 



Define the preliminary relation ~ 7 between embeddings as follows: a ~ 7 b if, for each 
k > and all embeddings c\, . . . , Cf~, 



where a ) 7 is a \ {(3: a{j3) < 7}. In other words, a ~ 7 b iff, for any 5, if either ac\ • • • c k S 
or bc\ ■ ■ ■ c k S is less than 7, then ac\ ■ ■ -CkS = bc\ • • -c^S. This is easily seen to be an 



We can now define the final desired relation = 7 by: a = 7 b iff ra ~ r7 rb for all 
embeddings r (including r = id). This is also an equivalence relation. Since cr (ra) = 



Easily, if 7 < 8, then a ~ b implies a ~ 7 6, and the same holds for =. 
It follows immediately from the definitions of = 7 (with r = id) and ~ 7 (with k = 0) 
that, if a = 7 b and a5 < 7, then a5 = b5. 

If a = 7 a' and b = 7 b', then we have already shown that ab = ai ab' , so ab = 7 ab' . Also, 
r(ab)ci • ■ ■ Ck = ra{rb)c\ ■ ■ ■ Ck and r(a'b)ci • • • Ck = ra'{rb)c\ ■ • ■ Ck, so from a = 7 a' we get 
ab = 7 a'b. Similarly, we get (a o b) = 7 (a' o b) since r(a o b)ci ■ • • Ck = ra(rbci)c 2 ■ • • and 
the same for a' . (For the case k = 0, note that, if r(a o 6)5 < r7, then ra(rb5) < v-f, so 
ra(rb5) = ra'(rb5), so r(a o 6)5 = r(a ; o b)5.) Now, using the formulas a o 6 = ab o a and 
a 06' = ab' oa, we get (a 06) = 7 (ao6 ; ). So the equivalence relation = 7 respects application 
and composition of embeddings. 

Therefore, we have a two-sorted embedding algebra. ■ 

If the original embedding algebra satisfies the property ab(a(n)) = a(b(n)) for all em- 
beddings a, 6 and natural numbers n, then one can let O be the entire set u, rather than 
just the critical points, and the construction will work as before. As a result, one sees that 



aci • • • Ck 1 7 = bet ■ ■ ■ c k 1 7, 



equivalence relation, and Lemma 5.6 just states that a 



~ cr O) id. 
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the two-sorted embedding algebra includes an 'isomorphic' copy of the original embedding 
algebra, expressed in two-sorted form. [In order to see that deleting id and reinserting it 
later does not cause a problem, we must show that the new formulas for multiplying by id 
match the old ones. In other words, we must see that, if the original embedding algebra 
contained id, then it satisfied id ■ a = a and a ■ id = id. To see this, use the property above 
to get, for all n, 

(id • a)(n) = (id • a)(id(n)) = id(a(n)) = o(n) 

and 

(a • id)(a(n)) = a(id(n)) = a(n) = id(a(n)). 

So id • a = a, and a ■ id agrees with id at all numbers of the form a(n); but the only strictly 
increasing function from w to w which agrees with id at infinitely many places is id.] 

It is easy to see that, if the original embedding algebra is monogenic, then so is the 
two-sorted embedding algebra constructed above. 

We conclude this section with a proposition about two-sorted embedding algebras which 
is a substitute for Kunen's theorem about elementary embeddings. 

For any non-identity embedding a, the sequence cr(a), a(cr(a)), a(a(cr(a))), . . . is a 
strictly increasing sequence of ordinals, called the critical sequence of a. 

Proposition 5.7. In any monogenic two-sorted embedding algebra, if a 7^ id is an embed- 
ding, then the critical sequence of a is cofinal in the set of critical points (the range of cr). 
Also, 0(7) > 7 for any critical point 7 > cr(a). 

Proof. All members of the critical sequence are critical points (of the embeddings a, aa, 
a(aa), a(a(aa)), etc.). Let j be a non-identity embedding which generates the algebra, 
and let (n n : n G uj) be the critical sequence of j. We recall Lemma 2.15. It was stated for 
elementary embeddings, but the proof clearly works in the present context as well. Thus 
every a must move some ordinal K n , and hence cr(a) < K n ; this shows that the critical 
sequence of j is cofinal in the critical points. To complete the proof of the first claim, we 
now show by induction on expressions in j that, if a 7^ id and (a n : n G u) is the critical 
sequence of a, then a n > K n for all n. This is again trivial for a = j. Suppose it is true for 
b and c, with critical sequences ((3 n : n G u) and (7^: new) respectively. If a = be, then 
induction gives a n = lry n for all n, so a n = b^ n > 7 n > K n . If a = b o c, then «o is either 
flo or 7q. In the former case, the fact that a n+ i = b(ca n ) > ba n gives a n > (3 n for all n; 
similarly, in the latter case, we have a n > 7 n for all n. In either case, we get a n > K n , as 
desired. 

Now, if 7 > cr(a) is a critical point, then 7 > ao and 7 < a m for some m, so there is 
an n such that a n < 7 < a n +i. This gives 07 > aa n = a n+ i > 7. ■ 

6. Extended two-sorted embedding algebras 

In order to prove Theorem 5.2, we will need to perform a number of the arguments 
of Laver [7] in the context of two-sorted embedding algebras. This is straightforward for 
arguments involving only the operations which are built into these algebras, but some ar- 
guments use additional features of elementary embeddings. In particular, a few arguments 
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use ordinals of the form a(<7), defined to be the least ordinal greater than a{f3) for all 
j3 < 7. In this section, we will define an extended algebra which includes this operation and 
show that such algebras can be constructed from ordinary two-sorted embedding algebras; 
this will allow us to use this new operation to prove facts about the original algebra. 

Definition 6.1. An extended two-sorted embedding algebra is a two-sorted embedding 
algebra (with embedding set S and ordinal set O), together with two new operations, a 
cofinality function cf : O — > O and a mapping from £ x O to O for which we use the notation 
a, 7 I— > a(<7), satisfying the following additional axioms: 

a(6(<7)) = a6(<a 7 ); 
a(<6(< 7 )) = (oo6)(< 7 ); 
a(<7) < a7; 
if 7 < 5, then a7 < a(<5); 
if a = 7 6 and a(<<5) < 7, then a(<8) = b(<5); 
cf(cr(a)) = cr(a); 
cf(a(<7)) = cf 7; 
cf (07) = a(cf 7); 
cf 7 < 7; 

if a(cf 7) = cf7, then a(<7) = 07. 

The last two of these axioms are not used in this paper, but they might be useful for 
later applications. On the other hand, there are a few facts that are used in this paper 
but not given above, because they can be deduced from the axioms. 

Proposition 6.2. In an extended two-sorted embedding algebra: 

(1) a(< 7 ) > 7; 

(2) id(< 7 ) = 7; 

(3) a(<7) = 7 for 7 < cr(a); and 

(4) ifa(cfj) > cf 7, then a(<7) < 07. 

Proof. For all 5 < 7, we have a(<7) > a<5 > 5; hence, (1) holds. This and id(<7) < id(7) 
give (2); we then get (3) because a = 7 id. For (4), we have a(<7) < 07, and equality 
cannot hold because a(<7) and 07 have different cofinalities. ■ 

Again it is not hard to verify that the axioms for an extended two-sorted embedding 
algebra hold in the case where S is a set of elementary embeddings on V\ and O is the 
collection of limit ordinals less than A [7]. Also, any subalgebra of an extended two- 
sorted embedding algebra is also an extended two-sorted embedding algebra; in particular, 
if we keep the same set of embeddings but restrict the ordinals to those of the form 
a(<cr(6)), we get an algebra in which all ordinals have this form. (Proposition 6.2(3) gives 
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cr(o) = a(<cr(a)), so all critical points are in this set of ordinals; now the axioms easily 
imply that this set of ordinals is closed under all of the algebra operations.) 
We now state the main result of this section. 

Theorem 6.3. Suppose that we are given a two-sorted embedding algebra, in which every 
ordinal is a critical point. Then the algebra can be extended to a new two-sorted embedding 
algebra with the same embedding set, on which the required additional operations can be 
defined so as to give an extended two-sorted embedding algebra. 

The proof of this theorem will use the following two lemmas about two-sorted embedding 
algebras. 

Lemma 6.4. In any two-sorted embedding algebra, if 7 = cr(c), then: 

(a) cc(ca"f) < c(ca"f); 

(b) 007 is not in the range of c. 

Proof. For (a), note that cr (cc) = 07 > 7, so CC7 = 7; hence, 

c(ca"f) = cc(ca)(cj) > cc^ca)^ = cc(ca)(ccy) = 00(007). 

On the other hand, an element 5 of the range of c cannot satisfy ccd < c5; if 6 = c(3, then 
c5 = cc(c/3) = cc5. Therefore, (b) holds. ■ 

Lemma 6.5. In any two-sorted embedding algebra, if cr(r) = cr(s) = k, then rX < ran 
implies sX < san. 

Proof. Assume rX < ran. Note that cr(rs) = vk > k, so rsn = k; this gives 

r(s\) = rs(rX) < rs{ran) = rs(ra)(rsK,) = rs(ra)K < rs(ra)(rK) = r(saK). 

Since r gives an increasing function on the ordinals, we must have sX < san. ■ 

Proof of Theorem 6.3. Fix a two-sorted embedding algebra. Let £ and O be its embedding 
set and ordinal set, respectively, and assume that the range of cr is all of O. We must 
extend O to a larger collection of ordinals on which the operation a(<7) can be suitably 
defined. The remarks following Proposition 6.2 indicate that this new set of ordinals need 
only contain the ordinals a(<cr(6)) for a, 6 G S. The main step will be to define the 
linear ordering properly for such ordinals; it turns out that the properties of an extended 
two-sorted embedding algebra determine this ordering completely. 

Lemma 6.6. In an extended two-sorted embedding algebra, if 7 = cr(c) and 5 is any 
ordinal, then 

a(<7) < 5 ^=^> ca7 < cS. 
Proof. If a(<7) < 5, then the fact that 07 > 7 gives 

007 < ca(<c7) = c(a(< i y)) < cS. 
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On the other hand, if 5 < a(<7), then we can use c(<7) = 7 to get 

c5 < c(<a(<7)) = (c o a)(<7) = (ca o c)(<7) = ca(<c(<7)) = ca(<7) < car/. 

■ 

It follows that, if 7 = cr(c) and 5 = cr(<i), then 

a(<7) < b(<6) ca7 < c(6(<5)) 

•<=>- c6(<c5) ^ ca7 

-<=>- cd(cb)(c5) ft cd(ca'y) 

•<=>- C(i(ca7) < c(dbS). 

This tells us how to start the construction from the given two-sorted embedding algebra. 
We want to define a binary relation R on £ x O as follows: 

(a, 7)i?(6, 5) -<=>- cd(ca7) < c(dbS), 

where c and <i are chosen so that cr(c) = 7 and cr(<i) = 5. Such c and d do exist because 
every element of O is a critical point; we must now see that the definition of R does not 
depend on which c and d are chosen. If c' also has critical point 7, then Lemma 6.5 gives 

c(d o a) 7 < c(db5) c'(d o a) 7 < c'(eM), 

so cd(ca'j) < c(db5) iff c'd(c'a'j) < c'(db5). Also, if cf is another embedding with critical 
point 5, then cr(cd) = cr(ccf) = c5, so Lemma 6.5 gives 

cd(ca^) < cd(cb)(c5) -<=/- cd 1 '(007) < cd' (cb)(c5). 

Note that, by Lemma 6.4(b), c(<i o 0)7 < c(dbd) is equivalent to c(<i o a)7 < c(dbd), so 
(a,7)i?(6, 5) iff c<i(ca7) < c(dbd). Therefore, is well-defined. 

Lemma 6.4(a) implies that R is reflexive. We will now show that R is transitive. Suppose 
(a, p)R(b, a) and (6, a)R(c, r); fix embeddings r, s, t with critical points p, a, r, respectively. 
We then have rs{rap) < r(sba) and st(sba) < s(ter), so 

rs(rt(rap)) = rs(rt)(rs(rap)) 

< rs(rt)(r(sba)) 
= r(st(sba)) 

< r(s(tcr)) 
= rs(r(tcr)), 

so rt(rap) < r(tcr), so (a, p)R{c, r). The same proof using > instead of < shows that the 
negation of R is also transitive. 
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We now know that R is a preorder; if we define the relation ~ on £ x O by 



(a, 7) ~ (b, 6) 



(a,^)R(b,S) and (b, S)R(a, 7), 



then ~ is an equivalence relation on £ x (9 and R induces a partial order on the set of 
equivalence classes. Let O* be the set of equivalence classes; we will write [a, 7] for the 
equivalence class of (0,7). Let <* be the partial ordering induced by R on O* . We then 
have 



where cr(c) = 7 and cr(<i) = 5. The fact that the negation of R is transitive implies that 
any two elements of £ x O are .R-comparable (if xRy and yRx were both false, then xRx 
would be false, contradicting reflexivity), so <* is a linear ordering of O*. 

The various distributive laws imply that, for any e G £, we have (a, 7)i?(6, 5) if and 
only if (ea, e^)R{eb 1 ed). Therefore, e induces a mapping from O* to O* via the formula 
e[a, 7] = [ea, e7], and this mapping is strictly increasing. Also, we clearly have (eoe ; ) [a, 7] = 
e(e'[a,7]). 

The element [a, 7] of (9* is meant to represent a(<7) in an extended algebra. For this to 
extend the original algebra, we need an element H('j) of O* to correspond to each 7 6 O. 
This element will turn out to be [0,7], where c is any embedding with critical point 7. In 
order to see that this is well-defined and gives the proper ordering on the representatives 
in 0*, we need the following result. 

Lemma 6.7. // cr(c) = 7 and cr(d) = 5, then [c, 7] <* [d, 5] if and only if 7 < S. 

Proof. By definition, [0,7] <* [d,S] if and only if cd(cc^) < c(ddd). But cr(cc) > 7 and 
cr(dd) > 5, so this is equivalent to 007 < c5. Now, if 7 = 5, then cr(cd) = 07 > 7, so 
cd7 = 7 = 5 < c5. If 7 > S, then 007 > 7 > 5 = c<5, so [0,7] ^* [d, 5], so [0,7] >* [d, 5]. 
Symmetrically, if 7 < 5, then [0,7] <* [d, <5]. ■ 

So the correspondence between 7 and [c, 7] gives an order- preserving map H:0 — > (9*. 
This lets us define the new critical point map cr*:£ — > (9* by the formula cr*(c) = 
H(cr(c)) = [c, cr(c)]. 

We next verify that the embedding maps 7* e7* satisfy e[a,7] >* [0,7]. We must 
show that c(ec)(ca"f) < c(ec(ea)(e^)), where cr(c) = 7; to see this, note that 





c(ec)(ca"f) = ce(cc)(ca"f) 



< ce(cc)(ce(ca'-y)) 
= ce(cc(ca"f)) 

< ce(c(ca7)) 
= c(e(ca7)) 

= c(ec(ea)(e7)). 



by Lemma 6.4(a) 
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Clearly e(cr*(a)) = cr*(ea); since cr(aa) = a(cr(a)) > cr(a), this gives a(cr*(a)) = 
cr* (aa) > cr*(a). 

Next, we define the new ternary relation =* as follows: a =* 7 b iS a = s b for some 
<5 G such that 7* <* H(8). In other words, a agrees with b up to some new ordinal iff 
a agrees with b up to some old ordinal at least as high. Using this definition, it is easy to 
deduce all of the axioms about =* from the corresponding axioms about =, except for the 
axiom "if a =* 7 b and aS* < 7*, then a8* = b8* n ; this one will require more work. 

If cr(d) = 8, then H(8) = [d, 8] <* [a, 8] for any a, because dd(dd8) = 8 < d(da8). 

Lemma 6.8. // cr(c) = 7, then [0,7] <* H(5) if and only if cary < c8. 

Proof. Fix d with critical point 8; then [0,7] <* [d,S] is equivalent to cd(ca^) < c(dd8) = 
c8. It is clear that cd(ca^) < c8 implies caj < c8, because 007 < cd(ca^). On the other 
hand, if cci'j < c8, then ca^y < cr(c<f), so cd(ca'j) = ca^y < c5. ■ 

Lemma 6.9. // a = 7 b and [a, 5] <* H{^), then [a, 5] = [b,5]. 

Proof. It is enough to show that [b,5] <* [a, 8], since then one can interchange a and b. 
Fix d such that cr(<i) = 6. By the preceding lemma, we have daS < d^y. This allows us to 
conclude from da = dl db that da5 = dbS; since Lemma 6.4(a) gives dd(dbS) < d(db5), we 
get dd(db5) < d(da5), so [b,5] <* [a, 8], as desired. ■ 

We are now ready to prove the remaining property of =*: if a =* 7 b and o[c, p] <* 7*, 
then a[c,p] = b[c, p]. Fix 8 such that 7* <* H{8) and a = 5 b. We have [ac,ap] <* if(<5), 
so the statement preceding Lemma 6.8 gives H(ap) < H(8). Since H is order-preserving, 
we have ap < 8. Therefore, ap = bp, so, using ac = 5 be and Lemma 6.9, we get a[c, p] = 
[ac, ap] = [be, ap] = [be, bp] — b[c, p]. 

We have now completed the proof that £ and O*, together with the starred operations 
and relations, form a two-sorted embedding algebra. Also, we have a canonical order- 
preserving map H from O to O*, and it is easy to check that H sends all of the operations 
and relations to their starred equivalents; hence, (£, O) is isomorphic to a subalgebra of 
(£,0*), so (£,0*) is isomorphic to an extension of (£,0). It now remains to define the 
additional operations of an extended two-sorted embedding algebra for {£ , O*). 

Since we want the pair [6,7] to represent 6(<7), the formula a(<6(<7)) = (a o 6)(<7) 
indicates that we should define a(<[b, 7]) to be [006,7]. The fact that this is a valid 
definition (i.e., it does not depend on the choice of a representative (6, 7) for the equivalence 
class [6, 7] ) follows from the next lemma. 

Lemma 6.10. // [6,7] <* [6', 7'], then [aob, 7] <* [aob',Y]. 

Proof. Fix c and c' such that cr(c) = 7 and cr(c') = 7'. Since [6,7] <* [6', 7'], we have 
cc'(cby) < c(c'b'Y); applying c(c'a) to this gives c(c'a)(cc' (067)) < c(c'a)(c(c'b'Y)). But 

c(c'a)(cc'(cb^)) = cc' (ca)(cc' '(067)) = cc'(ca(c&7)) = cc'(c(a o 6)7) 

and c(c'o)(c(c'6 / 7 / )) = c(c / a(c / 6 / 7 / )) = c(c' (aob')^'), so we have cc' (c(aob)'j) < c(c' (aob')'j') 
and hence [006,7] <* [aob' ,1']. ■ 

So a(<[6, 7]) is well-defined. The next lemma shows that this definition matches the 
original motivation. 
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Lemma 6.11. For all a and 7, a{<H{^y)) = [a, 7]. 

Proof. Fix c such that cr(c) = 7; then a{<H{pf)) = [aoc, 7]. We have cr(ac) = 07, so 
ac = a7 id, so a = a7 ac o a = a o a From 7 < 07, we get 007 < 02(07) = c ( a 7)? so 
Lemma 6.8 gives [0,7] <* H(a^). Therefore, Lemma 6.9 gives [0,7] = [aoc, 7], as desired. 

■ 

We now verify that this definition of a(<7*) satisfies the first five axioms listed in 
Definition 6.1. Let 7* = [c, p] and S* = [d,a\. The first two axioms are proved by simple 
computations: 



a(6(<7*)) = a[6oc, p] = [aboac, ap] = ab(<[ac,ap\) = a6(<a7*), 
a(<6(<7*)) = a(<[boc,p}) = [ao&o C , p] = (ao6)(< 7 *). 

The next two axioms are equivalent to: 7* <* 8* if and only if a(<7*) <* ad*. To prove 
this, fix r and s such that cr(r) = p and cr(s) = a; then 

a (<7*) <* -<=>- [aoc, p] <* [ad,aa] 

-<=>- r(as)(r(ooc)p) < r(as(ad)(acr) 

-<=>- ra(rs)(ra(rcp)) < r(a(s<icr)) 

-<=>- ra(rs(rcp)) < ra(r(sda)) 

-<=>- rs(rcp) < r(sda) 

<^ 7* <* 5*. 



For the fifth axiom, suppose a =* 7 6 and a(<5*) <* 7*. Find 7/ such that a b 
and 7* <* H(rj); then a(<5*) <* H{rj) and a o d = v b o d, so Lemma 6.9 gives a(<5*) = 
[aod, a] = [6od, a] = b(<5*). 

It remains to find a suitable definition for the cofinality function. Since [a, 7] is supposed 
to represent a(<7), where 7 is a critical point and hence regular, we define cf [0,7] to be 
#(7). As usual, we need a lemma showing that this does not depend on the choice of a 
representative for the equivalence class [a, 7] . 

Lemma 6.12. If 7 7^ 8, then [0,7] 7^ [6,5]. 

Proof. We may assume 7 < 6. Fix c and d such that cr(c) = 7 and cr(d) = 5; then cr(dc) = 
07 = 7 and cr(dcd) = dc5 > 5 > 7. We can use dc instead of c when comparing [a, 7] with 
[6,5]: [a, 7] <* [6,5] iff dcd(dca'j) < dc(db5). Now the assumption that [0,7] = [6,5] leads 
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to a contradiction as follows: 



dcd(dca"f) < dc(dbd) 
< d(ca"f) 
= d{ca){d r y) 
= dc(da) , y 
= dcd(dca) (dcd^) 
= dcd(dca"f). 



since [a, 7] <* [6, 8] 
since [6, 5] <* [a, 7] 



It is now trivial to verify the axioms cf(cr*(a)) = cr*(a), cf(a(<7*)) = cf7*, and 
cf(a7*) = a(cf 7*). 



The last two axioms can actually be deduced from the other axioms when the ordinal 7 
is of the form b(<5) where S is a critical point; since every element of O* has this form, this 
will suffice here. The law cf 7 < 7 follows from Proposition 6.2(1), since cf 7 = cf 5 = 5. 
Now suppose a does not move 5 = cf7; then a(<b(<5)) = (a o b)(<5) = (ab o a)(<5) = 
ab(<a(<5)) and a(b(<5)) = ab(<a5), and these two ordinals are equal because 5 < a(<5) < 
ad = 5. 

This completes the proof of Theorem 6.3. ■ 

Theorem 6.3 can be used to transfer various arguments from the context of elementary 
embeddings to that of two-sorted embedding algebras. One example is the following result, 
which Laver proved for elementary embeddings (Theorem 2.14). 

In a two-sorted embedding algebra, let j 7^ id be some embedding, and let Aj be the 
set of embeddings generated from j by the operation • (so each a G Aj is given by a word 



Theorem 6.13. Assume that the set of all critical points of elements of Aj has order 
type ui. If a and b are distinct elements of Aj, then there is a critical point 7 such that 
a( 7 ) ^ 6(7). 

Proof. We may assume that all ordinals in the algebra are critical points; otherwise, just 
move to the subalgebra comprising all embeddings and all critical points. Apply Theo- 
rem 6.3 to construct an extended two-sorted embedding algebra which is an extension of 
the given algebra. We now follow the proof of Theorem 13 from Laver [7]; every step ex- 
cept one in this proof uses only properties of the extended ordinals which are listed in 6.1 
and 6.2, and hence works in the same way here. The one exception is the use of the fact 
that a certain increasing sequence of critical points is cofinal in the set of all critical points 
of Aj; we have made this fact an assumption of the theorem. The result is that, in the 
extended algebra, there exists a critical point 7 such that 07 7^ 67. But all critical points 
in the extended algebra are critical points in the original algebra (since the same holds for 
embeddings), so so we have the desired result in the original algebra. ■ 
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7. Construction of an embedding algebra 

In this section, we will prove one direction of Theorem 5.2 by showing how to con- 
struct an embedding algebra under the assumption that is free (and hence all of the 
statements in Theorem 4.4 hold). 

We will first construct a two-sorted embedding algebra. The embedding set £ will be 
-Poo U {0}, while the ordinal set O will be u. The operations • and o on £ will of course be 
those obtained from Poo, and will be the identity in £. 

We note that 4.4(vi) implies the stronger statement that, for every a G Wp, there is 
an n such that [a] n ^ 0. To see this, use Lemma 2.11 to find a word in Wp of the form 
a\ o ■ ■ ■ o a,k (ai, . . . , afc G Wj) which is equivalent to a. By 4.4(vi), there exists n so large 
that [ai] n 7^ for all i; then formula (3.3) implies that [a] n ^ 0. 

For each a G Wp, define the function e a :u — > u> as follows: for each n G u, let 
e a (n) = s(a ■ 2 n ). In other words, e a (n) is the largest m such that [a ■ 2 n ] m = 0. (By the 
strengthened 4.4(vi), there is a largest such m for each n.) If a = b in P^, then [a] m = [b] m 
and [a • 2 n ] m = [a] m * [2 n ] m = [b] m * [2 n ] m = [b ■ 2 n ] m for all n and m; hence, e a = e b . 
It therefore makes sense to write e a for a G Poo- This will give the desired application 
function from £ x O to O, so we will sometimes write a{n) for e a (n) (but not an, as this 
might be confused with a ■ n). Define eo to be the identity function on ui. 

For any a G P^, we can apply Proposition 3.2 to show that, if [a ■ 2 n ] m = 0, then 
[a ■ 2 n+1 ] m _|_i = 0; it follows that the function e a is strictly increasing. (This is obviously 
true for eo as well.) Now induction gives e a (n) > n for all n. 

Next, we prove that e aQ 6 = e a oe& (i.e., the algebra operation o represents composition). 
This follows from the corollary to Lemma 4.3: 

e aob (n) = s((a o b) ■ 2 n ) = s(a ■ (b ■ 2 n )) = s(a ■ T^) = s(a ■ 2 e ^) = e a (e b (n)). 

For a G Poo, define cr(a) to be the largest m such that [a] m = 0, as given by the 
strengthened 4.4(vi). (We will see later that this is the critical point of e a .) It follows that 
[a]m +1 = 2 m , so [a-2 m ]m+i = [2 m -2 m ] m+ i = 0. (For the last equality, see Proposition 3.2.) 
This proves that a(cr(a)) > cr(a). 

We now define = N for N e O by: a = N b iff [a] N = [b] N . The fact that the algebra P N 
satisfies (LL) immediately implies most of the desired properties of = N . In particular, if 
a = N b and a(m) < N, then [a-2 m ]Ar is nonzero, and [b-2 m ]isr must have the same nonzero 
value, so we find that a(m) = b(m). The only remaining property that is nontrivial is 
coherence, for which we argue as follows. Suppose [a] at = [b] n and M = e c (N); we must 
show that [co\m = [c6]m- The definition of M implies that [c-2 n ]m = 0, so the period of c 
in P M divides 2^. But [a] at = [6]iV) so [o\m and [6]m are congruent modulo 2^; therefore, 
[cci]m = [c6]m, as desired. 

This completes the construction of the two-sorted embedding algebra. The point of con- 
structing this intermediate algebra is that it allows us to apply Theorem 6.13 to conclude 
that, if a ^ b in A^, then e a ^ e&. 

We now construct an embedding algebra as follows. Let A = { e a : cl G Aoo}- Define the 
operation • on A by the formula e a ■ e b = e ab ; this definition is valid because the mapping 
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from a to e a is one-to-one. It is clear that A is generated from the single function e\ by 
the operation •. 

Proposition 5.4(1) implies that the critical point of e a is equal to the number cr(a) 
defined above. Given this, it is easy to see that A satisfies the axioms of an an embedding 
algebra by using the corresponding properties of the two-sorted embedding algebra. This 
completes the construction. 

8. Uniqueness of embedding algebras 

In this section, we will prove the following uniqueness result for monogenic embedding 
algebras. 

Theorem 8.1. (a) If (A, •) is a monogenic embedding algebra for which every natural 
number is a critical point, then (A, •) is isomorphic to the embedding algebra constructed 
from Poo in the preceding section. 

(b) If (£, O; •, o, . . . ) is a monogenic two-sorted embedding algebra in which the ordinals 
have order type u> and every ordinal is a critical point, then it is isomorphic to the two- 
sorted embedding algebra constructed from P^ in the preceding section. 

Along the way, we will show that, if a nontrivial embedding algebra (or a nontrivial 
two-sorted embedding algebra with ordinals of order type u) exists, then 4.4(vi) holds, 
and hence A^ is free, thus completing the proof of Theorem 5.2. Most of the arguments 
in this section are adapted from Laver [7]. 

If an embedding algebra satisfies the hypotheses of Theorem 8.1(a), then, as noted after 
the proof of Proposition 5.5, we can expand/extend it to a two-sorted embedding algebra 
as hypothesized in Theorem 8.1(b). So let us assume we have such a two-sorted embedding 
algebra. Let j be the generating embedding, and let Aj be the set of embeddings generated 
from j using • alone. As noted in section 5, the set of non-identity embeddings is closed 
under •, so every element of Aj has a critical point. For any a G Wj[, let j a be the result of 
replacing each 1 in the expression a with j. (Note that Aj = {j a : a G VF4}.) In particular, 
since we identified positive integers with words in Wj±, we have an embedding j m for each 
m > 0, and j\ = j; also, we let jo = id. 

Let 7 n be the critical point of j2». Recall that, for any a G [a] n is defined to be 
the result of evaluating a in A n = {0, 1, . . . , 2 n — 1}. 

Proposition 8.2. For any a G W\, j a = 7 ™ j [a]n . 

Proof. Since ji n = 7 ™ id = jo, it does not matter whether we work with A n or A' n = 
{1, . . . , 2 n }. Clearly the proposition holds for a = 0. We will show that, for any 6, c G A' n , 
j b j c = ln jb* nC 'i given this, an easy induction on a G yields the proposition. 

The proof of j b j c = 7 ™ jb* n c is by double induction, downward on b and upward on c. 
For b = 2 n , we have j2 n 3c = ln id • j c = 3c = J2™*„c- The case b < 2 n , c = 1 is also trivial: 
jbji = jb+i = jb* n i- Finally, for 6, c < 2 n , 

jbjc+l = jb(jcj) = (jbjc)(jbj) = ln jb* n cjb+l = ln j{b* n c)* n {b+l) = 3b* n (c+l)- 
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(The induction hypothesis can be used in the second-to-last step because b* n c> b.) This 
completes the induction. ■ 

Proposition 8.3. For all n, 7 n < 7 n +i/ also, for all m > 0, cr(j m ) = 7^ where 2 k is the 
largest power of 2 dividing m. 

Proof. By induction on N, we show that these statements are true for n < N and m < 2 N . 
The case N = is vacuous. Suppose now that the assertion is true for N; we will 
prove it for N + 1. We know that 70 < 71 < • • • < 7/v- By definition, cr(j m ) = 77V 
if m = 2 N . If 2 N < m < 2 N+1 , then Proposition 8.2 implies that j m = lN j m -2 N i so, 
if 2 k is the largest power of 2 dividing m — 2^, then 2 k is also the largest power of 2 
dividing m, and cr(j m _ 2 iv) = 7fc < 7 at, so cr(j m ) = 7/5. This means that the embeddings 
j m for 2 N < m < 2 N+1 all have critical points below 77V, and hence, by Proposition 5.7, 
3m{lN) > 7jv! let 6 > he the least of these values j m {lN)- Now coherence gives 
jm+i = Jmj = 6 jmih^j) for all m in this range, so 

32 N +! = J2 N + 1 -lU2 N J) 

= e j2^+i-2(j2~j)(j2~i) 
—6 

= 32 N +i{h N 3) ■ ■ ■ (h N 3) = 32 N 32 N - 



Since cr(j 2 Nj 2 N) = ]2 n {in) > 7tv and 7/v < 0, we must have cr^^+i) > 7jv- This 
completes the induction. ■ 

We can now show that 4.4(vi) holds, and hence Aoo is free, as follows: Suppose a G Wj^. 
Since the sequence of critical points 7 n is strictly increasing, and the ordinals have order 
type u, there must be an n such that cr^'o) < 7 n . Then j a id = jo, so we must have 
[a] n 7^ 0. This completes the proof of Theorem 5.2. 

Proposition 8.3 implies that j m id for 1 < m < 2 n (because cr(j m ) < 7 n ). Con- 
sequently, we have j m j m > for 1 < m < m' < 2 n ; if this were not so, then one could 
apply j m and j m ' to j 2 n — m' times to get j m +2 n -m' = 7 ™ J2 n = ln id, a contradiction. 

It follows that the mapping j a /= ln 1— > [a] n from Aj/= ln to A n is bijective and preserves 
the operation •, so it is an isomorphism. These mappings commute with the canonical 
projections from Aj/= Jn+1 to Aj/= ln and from A n+ i to A n , so they give a mapping from 
Aj to the inverse limit of the algebras A n ; clearly this mapping sends the generator of Aj 
to the generator of A^, so we have a mapping / from Aj onto A^. Since / preserves •, 
and since A^ is free, / must be an isomorphism between Aj and A^. 

For any a G if n is so large that cr(j a ) < 7 n , then Proposition 8.2 gives j a = ln j[ a ] n , 
and [a] n must be nonzero, so, by Proposition 8.3, cr(_7 a ) = 7^ where 2 k is the largest power 
of 2 dividing [a] n . This k is just s(a). Also, for any m, we get 



ja{lm) =ja(cr(j 2 -)) = Cr(j a J 2 -) = Cr(j a . 2 m) = 7 s ( a . 2 -) = 7e Q (m), 
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where e a is as defined in section 7. Finally, for any a, b G we have 

ja = ln jb ^ j[a] n = ln 3[b] n ^ [a]b = [b] n - 

Therefore, the structure of Aj is determined completely except for the possible existence 
of ordinals which are not critical points. (Even for these, the equivalence relation = s is 
determined; the argument of the preceding paragraph shows that, if 7 n -i < 5 < 7 n , then 
j a = s jb if and only if [a] n = [b] n -) In the situation of Theorem 8.1(a), there are no such 
extra ordinals, and we have 7 n = n for all n; we can now see that the structure of Aj 
(which is just a copy of the original embedding algebra A) exactly matches the structure 
defined in section 7 from P^. So Theorem 8.1(a) is proved. 

Now, in the situation of Theorem 8.1(b), let Pj be the set of embeddings generated 
from j using both • and o. (Since the algebra is generated by j, this is all embeddings 
except id.) In order to show that composition here matches the structure from section 7, 
we use the following result. 

Proposition 8.4. If a, b G Aj and n G ui, then there is c G Aj such that aob = ln c. 

Proof. We may assume that cr(a) > cr(6); otherwise, replace a and b with ab and a (using 
a o b = ab o a). Now let ao = b, a\ = a, and ai = aj_iaj_2 for % > 2. Induction gives 
aj + i o a,i = a o 6, cr(6) = cr(ao) = cr(a2) = cr(a4) = . . . , and cr(a) = cr(ai) < cr(as) < 
cr(as) < .... Since the sequence cr(a2i+i) is a strictly increasing sequence of critical 
points, and the set of all critical points has order type u> : there must be an odd i such that 
cr(dj) > 7 n ; this gives a o b = a,i o a^_i = 7 ™ a^-i, so we can let c = a^_i. ■ 

It follows that, if a, b G A n , then there is c G A n such that j a o j b = 7 ™ j c - we know from 
the above results that this c is unique. To determine what c is, note that (j a oj b )j = 7 ™ j c j, 
so j a * n (b+i) = 3c+ij so a * n (6 + 1) = c + 1, where the additions are performed modulo 2 n 
in A n ; hence, c = (a * n (b + 1)) — 1 = a o n b. We therefore have j a o j b = 7 ™ j a o n b fc> r 
a, b G A n ; now, if we define j a for a G W-p as we did for a G Wj_, then induction on a 
gives j a = ln j[ a ] n for all a G W-p. We can now argue as before that Pj/= ln is isomorphic 
to P n and Pj is isomorphic to P^, so the structure of Pj is unique except for the possible 
existence of ordinals which are not critical points, and matches that from section 7. This 
completes the proof of Theorem 8.1. 

One can in fact construct an embedding algebra with numbers that are not critical 
points, either by just duplicating every critical point or, less trivially, by constructing the 
extended algebra in section 6 and then using the method of section 7 to convert this to 
an embedding algebra. (One can then modify the algebra further to get an embedding 
algebra which does not satisfy ab(a(n)) = a(b(n)).) For the less trivial construction, one 
must observe that the ordinals in the extended algebra have order type uj. To see this, 
note that if a = ln b and a(<K) < 7 n , then a(<K) = b(<K,); hence, there are at most n2 n 
extended ordinals below 7 n . 

On the other hand, we now have a roundabout proof that, if there is a nontrivial 
embedding algebra, then there is one in which all natural numbers are critical points (and 
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hence ab(a(n)) = a(b(n)) holds), namely the one constructed from P^. One would expect 
to be able to prove this directly, by simply deleting the natural numbers which are not 
critical points and relabeling the critical points as 0, 1, 2, ... . However, it is conceivable 
that distinct functions in the algebra are the same when restricted to the critical points, 
so that • could fail to be well-defined after the other numbers are deleted. It turns out that 
this does not happen in the monogenic case, but the authors do not see a way to prove 
this without building up enough structure to imitate Laver's proof of Theorem 2.14. 

9. The strength of "A^ is free" 

As we recalled in section 1, Laver's proof of the irreflexivity of the free left distribu- 
tive algebra on one generator assumed the existence of a nontrivial elementary embedding 
from V\ to itself; this is an extremely strong large cardinal hypothesis. (Actually, Laver 
had noted that, since one only needs a bounded part of V\ to talk about the finitely many 
embeddings mentioned while comparing two given words in the free algebra, the assump- 
tion can be reduced to the existence of an n-huge cardinal for each natural number n.) 
The possibility that the irreflexivity property was strong enough to require large cardi- 
nal assumptions for its proof remained until Dehornoy proved the property without such 
assumptions (in fact, using only Primitive Recursive Arithmetic). 

We now consider the statement "Aqo is free" and the equivalent versions in Theorem 4.4. 
These statements imply that is both free and irreflexive, so the irreflexivity of the free 
algebra follows immediately. The purpose of this section is to show that the statement 
U A 00 is free" is strictly stronger than the statement "the free algebra is irreflexive," in the 
following sense: 

Theorem 9.1. The statement "A^ is free" is not provable in Primitive Recursive Arith- 
metic. 

Of course, we assume throughout that PRA is itself consistent. 

Proof. It is a well-known result from proof theory (see Sieg [8]) that the only recursive 
functions that can be proved to be total using only PRA are the primitive recursive func- 
tions. Therefore, to prove the theorem, it will suffice to show that PRA + 4.4(vii) proves 
the totality of a recursive function F which is not primitive recursive. 

For each natural number n, let F(n) be the largest m such that [it n ] m = 0, where u n 
is the word 1 • (1 • (. . . (1 • 1) . . . )) with n + 1 l's. It follows from 4.4(vii) that F is a 
total recursive function. If the functions e a are defined as in section 7, thus giving an 
embedding algebra, then F{n) = e™(0), so F is the critical sequence of the mapping e±. 
Since all natural numbers are critical points in this embedding algebra, one can state that 
F(n) is the number of critical points below e"(0). 

We now use the methods of Dougherty [5] for producing many critical points. That 
paper is written in terms of elementary embeddings, but it is not hard to check that the 
only properties used in section 2 of that paper are that each embedding gives a strictly 
increasing monotone function on the ordinals and that, if a and b are two such embeddings, 
then cr(a6) = a(cr(6)) and a(bj) = ab(a^) for all ordinals 7. Hence, the main theorem 
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of that paper, that the number of critical points below K n = j n (Ko) grows so rapidly 
with n that it cannot be primitive recursive, applies to any nontrivial embedding algebra 
or two-sorted embedding algebra. (The results in later sections of that paper use only 
the properties of an extended two-sorted embedding algebra, so the stronger lower bounds 
obtained there also apply to any nontrivial embedding algebra or two-sorted embedding 
algebra.) But, in the embedding algebra from section 7, the number of critical points 
below K n is just F(n) as defined above, so F is not primitive recursive. ■ 

On the other hand, the freeness of follows from the existence of a nontrivial el- 
ementary embedding j:V\ — > V\. The proof of this (due to Laver) uses Theorem 2.13. 
Given this theorem, we can apply the arguments in section 8 to the monogenic two-sorted 
embedding algebra obtained from Pj to conclude that Aoo is free. Laver (personal com- 
munication) has recently noted, and the authors have confirmed, that one can use the 
method of proof of Theorem 2.13 while working with only an n-huge embedding, to get a 
correspondingly weaker result; hence, the freeness of follows from the existence of an 
n-huge cardinal for each natural number n. (There is a level-by-level form of this result: 
if a /c-huge cardinal exists, then there is a natural number n such that [uk] n 7^ 0.) 

The proof of Theorem 9.1 showed that the assumption that A^ is free can be used to 
construct a particular function F which grows too rapidly to be primitive recursive. It 
turns out that one cannot produce any function growing much faster than F from this 
assumption. This can be stated precisely as follows. 

Proposition 9.2. Any recursive function which is provably total in PRA + "A^ is free" 
must grow more slowly than F m for some m, where F = F and F m+ i is the iteration 
of F m (starting at 1, say; that is, F m +i(n) = F^(l)). 

Proof. The proofs in Sieg [8] can be modified to give the following extended version of the 
proof-theoretic result used earlier: 

If P(n,m) is a primitive recursive predicate, f(n) is the least m such that 
P(n, m) holds, and g is a recursive function which is provably total in PRA + 
\/n3mP(n,m), then g can be obtained from / and trivial functions (constants, 
projections, and successor) by composition and primitive recursion. 
The function F can be used as /, since P(n, m) can be defined to be u u n ^ in A m+ i. n 
Also, 4.4(vii) is a consequence of PRA+Vn3mP(n, m). Therefore, any recursive function g 
provably total from PRA + u A oc is free" must be obtainable from F and trivial functions 
by the operations of composition and primitive recursion. Now the standard proof by 
induction on the number of such operations used shows that g is below F n for some n. ■ 

As a particular case of this, recall that there is a primitive recursive algorithm for 
comparing two expressions a and 6, i.e., transforming them into equivalent expressions a' 
and b' such that either a' = b' or one of a', b' is a left subterm of the other. Starting with 
this, one can go through Laver's proof that any two distinct embeddings must differ at 
a critical point, and verify that all of the steps are primitive recursive. Hence, assuming 
is free, if a and b are members of Wj_ such that a b, and n is least such that j a 
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and jb differ at critical point number n, then n can be obtained from a and b by a function 
whose growth rate is comparable to that of F. 

10. Open problems and acknowledgments 

There remain a number of open problems related to these algebras. The main one, of 
course, is the exact strength of the statement u A OQ is free" ; the gap between "more than 
PRA" and "there is an n-huge cardinal for each n" is rather large. One can also ask 
whether "there is a nontrivial two-sorted embedding algebra" is as strong as "there is a 
nontrivial embedding algebra." 

It is still open whether Laver's result on distinguishing elementary embeddings by their 
behavior on critical points (Theorem 2.14) can be extended to Pj. If it can, by methods 
formalizable in an extended two-sorted embedding algebra, then one can define a version 
of embedding algebra which includes a composition operation, and the existence of a 
nontrivial such algebra will still be equivalent to "A^ is free." 

Another area of interest is further extensions of the results in section 6 to include 
more of the ordinals that can be defined from elementary embeddings. (Eventually one 
might hope to start with the embedding algebra obtained from and construct a larger 
structure including all of the important features of the algebra obtained from an elementary 
embedding from V\ to itself.) A natural next step is to try to define ordinals of the form 
"the least a such that a(a) > 7" for a given embedding a and ordinal 7. Such ordinals seem 
to be closely tied to the inequality 00(7) < 0(7): the existence of the ordinals allows one 
to prove that the inequality holds, and the authors can show under the assumption of the 
inequality that there is a natural extension of a given monogenic two-sorted embedding 
algebra in which all ordinals are critical points to an algebra including such ordinals. 
The authors do not yet have a large-cardinal-free proof that the inequality holds in the 
embedding algebra constructed from Aoo, even assuming that Aoo is free. 

The authors would like to thank P. Dehornoy for discussing his work and suggesting 
further questions, R. Laver for showing us his unpublished results, M. Rathjen for con- 
sultations about proof theory, and J. Zapletal for pointing out the construction used in 
Proposition 2.12. 

References 

1. P. Dehornoy, Free distributive groupoids, J. Pure Appl. Algebra 61 (1989), 123-146. 

2. , Sur la structure des gerbes libres, C. R. Acad. Sci. Paris Ser. I Math. 309 (1989), 143-148. 

3. , The adjoint representation of left distributive structures, Comm. in Algebra 20 (1992), 1201- 

1215. 

4. , Braid groups and left distributive operations, Trans. Amer. Math. Soc. 345 (1994), 115-150. 

5. R. Dougherty, Critical points in an algebra of elementary embeddings, Ann. Pure Appl. Logic 65 
(1993), 211-241. 

6. R. Laver, The left distributive law and the freeness of an algebra of elementary embeddings, Adv. 
Math. 91 (1992), 209-231. 

7. , On the algebra of elementary embeddings of a rank into itself, Adv. Math. 110 (1995), 334- 

346. 

8. W. Sieg, Fragments of arithmetic, Ann. Pure Appl. Logic 28 (1985), 33-71. 



FINITE LEFT-DISTRIBUTIVE ALGEBRAS AND EMBEDDING ALGEBRAS 



9. F. Wehrung, Gerbes primitives, C R. Acad. Sci. Paris Ser. I Math. 313 (1991), 357-362. 

Department of Mathematics, Ohio State University, Columbus, OH 43210 
E-mail address: rld@math.ohio-state.edu 

Pennsylvania State University, 215 McAllister Building, University Park, PA 1 
E-mail address: jech@math.psu.edu 



